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Abstract 

A search for the lepton flavour violating decay — > /x ± 7 has been performed using 221.4 fb _1 
of data collected at an e + e~ centre-of-mass energy of 10.58 GeV with the BABAR detector 
at the PEP-II storage ring. The search has an efficiency of 7.45±0.65% for an expected 
background level of 6.2±0.5 events. In the final sample 4 candidate events are selected. As 
there is no evidence for a signal in this data, for this preliminary result we set an upper limit 
of B{t ± — ► ^i ± 7) < 9 x 10~ 8 at 90% CL using the method of Feldman and Cousins. 
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Decays violating lepton flavour number, if ob- 
served, would be among the most theoretically 
clean signatures of new physics. The decay 
t — > /z 7 is one such process and is ex- 
pected with potentially observable rates in su- 
persymmetric models [1I2I3| . left-right supersym- 
metric models j3] and supersymmetric string uni- 
fied models 5 . For some ranges of model param- 
eters, decay rates as high as several parts per 
million are expected for this decay ^EI, even in 
light of the current experimental limit on the 
related ^ ± — > e ± j decay[B]. The recently re- 
ported non-zero charge-parity asymmetry in b-s 
radiative penguin processes also suggests large 
S(t ± — > /x 7) in grand unified theories with 
supersymmetry[S]. On the other hand, the mod- 
est extensions to the standard model (SM) incor- 
porating neutrino oscillations predict a branching 
ratio many orders of magnitude below experimen- 
tal accessibility (ss 10 _40 [5]). Therefore a discov- 
ery of this decay would require new physics and a 
non-observation of the decay would place restric- 
tions on the parameters in the theories predict- 
ing large branching ratios. Currently the most 
stringent limit is S(t ± — > ^7) < 3.1 x 10~ 7 
at 90% confidence level (CL) from the BELLE 
experimental using 86.3 fb _1 of e + e~ annihila- 
tion data. 

The search for — * /-i ± 7 decays reported here 
uses data recorded by the BABAR detector at the 
PEP-II asymmetric-energy e + e~ storage ring op- 
erated at the Stanford Linear Accelerator Cen- 
ter. The data sample consists of an integrated 
luminosity of L=205.4fb~ 1 recorded at a centre- 
of-mass energy (y/s) of 10.58 GeV and 16.1 fb _1 
recorded at y/s = 10.54 GeV . With a cross sec- 
tion for e + e~ — > t + t~ at the luminosity-weighted 
yfi of cr TT = (0.89±0.02) nbQT , this data sample 
contains 1.97 x 10 s e + e~ — * t + t~ events. 

The BABAR detector is described in detail in 
Rcf. 12 . Charged particles are reconstructed as 
tracks with a 5-layer silicon vertex tracker and 
a 40-layer drift chamber (DCH) inside a 1.5-T 
superconducting solenoidal magnet. An electro- 
magnetic calorimeter (EMC) consisting of 6580 
CsI(Tl) crystals is used to identify electrons and 
photons. A ring-imaging Cherenkov detector is 
used to identify charged hadrons. The flux return 



(IFR) of the solenoid, instrumented with resistive 
plate chambers, is used to identify muons. 

The signature of the signal process is the pres- 
ence of an isolated fi and 7 having an invariant 
mass consistent with that of the r (1.777 GeV/ c 2 
|13|). the energy of the /i and 7 (B^) equal to 
y/s/ 2 in the event centre-of-mass (CM) frame, 
and the characteristics of the other particles in 
the event consistent with a SM r decay. Such 
events are simulated with higher-order radiative 
corrections using KK2f ^T] where one r decays 
into /1 7 with a flat phase space distribution^], 
while the other r decays according to measured 
rates^j simulated with Tauola |16I17| . The de- 
tector response is simulated with GEANT4 |18| . 
The simulated events for signal as well as SM 
background processes |1 II 1 61 1 71 1 9I20| are then re- 
constructed in the same manner as data. The 
dominant backgrounds are from SM e + e~ — > 
(7) and e + e~ — ► t + t~ (7) events. 

Events with two or four well reconstructed 
tracks and zero net charge are selected. The 
thrust, calculated with all observed charged and 
neutral particles, is required to lie between 0.9 
and 0.975 to suppress e + e~ — > qq background 
with low thrust and e + e" — > and Bhabha 

backgrounds with thrust close to unity. In order 
to ensure the presence of at least one v within 
the acceptance of the detector, the lab-frame po- 
lar angle (9 m i SS ) of the missing momentum of the 
event is required to lie within the geometrical ac- 
ceptance of the detector (—0.76 < cos0 m j ss < 
0.92) and the missing CM transverse momentum 
(Pmiss) i s required to be significantly above zero: 
— ln(2 x p^ tss /y/s) < 2(4) for events with two 
(four) tracks. 

The signal-side hemisphere, defined with re- 
spect to the thrust axis, is required to contain one 
track with CM momentum less than 4.5GeV/c 
and at least one 7 with a CM energy greater than 
200 MeV. The track must be identified as a /1 us- 
ing DCH, EMC and IFR information and the 7 
candidate is the one which gives the mass of the /z 
7 system closest to the r mass. The resolution of 
the mass of fi 7 is improved by using a kinematic 
fit with £^ 7 constrained to y/s/ 2 and by assign- 
ing the point of closest approach of the /x track 
from the e + e~ collision axis to the origin of the 7 



candidate. This energy-constrained mass (toec) 
and AE =E^ 1 —^/s/2 are independent variables 
in the absence of initial state radiation (ISR). 
The mean and width of the Gaussian core of the 
toec and AE distributions for reconstructed sig- 
nal events are: (mEc) = 1777 MeV/c 2 , ct(toec) 
= 9MeV/c 2 , (AE) = -9MeV, a(AE) = 45MeV, 
where the shift in AE to low values comes from 
the tail induced by ISR. 

Potential biases in the analysis are minimized 
by blinding the data events within a 3cr ellipse 
centred around (toec) and (AE) in the itiec-AE 
plane until all the optimization and systematic 
studies of the selection criteria have been com- 
pleted. 

In order to suppress e + e~ — > (7) and 

e + e~ — > t + t~ (7) events containing energetic fi- 
nal state radiation and radiation in — > \i~^vv 
decays, an isolation criterion is imposed on the y, 
by requiring | cos#ff | < 0.8, where 9h is the an- 
gle between the /i momentum in the r rest frame 
and the t momentum as measured in the lab 
frame. Background contamination arising from 
t — * h(> \)n°v decays with the hadronic track 
(h) mis-identified as a fi is reduced by requir- 
ing the sum of the CM energy of non-signal pho- 
ton candidates in the signal-side to be less than 
200 MeV. If the reconstructed neutral particle 
identified as the signal photon has at least a 1% 
likelihood of arising from overlapping daughters 
in 7T° — > 77 decays, the event is removed from 
consideration. 

The tag-side hemisphere, which is expected to 
contain a SM r decay, is required to have a to- 
tal invariant mass less than 1.6 GcV/c 2 and a CM 
momentum for each track less than 4.0GeV/c to 
reduce background from e + e~ — > qq and e + e~ — > 
H + H~ processes, respectively. The qq background 
is further reduced by requiring the hemisphere to 
have no more than six photon candidates. 

A tag-side hemisphere containing a single track 
is classified as e-tag, /x-tag or h-tag if the total 
photon CM energy in the hemisphere is no more 
than 200 MeV and the track is exclusively iden- 
tified as an electron (e-tag), as a muon (/x-tag) 
or as neither (h-tag). If the total photon CM 
energy in the hemisphere is more than 200 MeV, 
then events are selected if the track is exclusively 



identified as an electron (e7-tag) or as neither an 
electron nor as a muon (117-tag). These allow for 
the presence of radiation in r — > evv decays and 
for photons from 7r° — > 77 in t — > h(> l)ir°v 
decays. If the tag-side contains three tracks, the 
event is classified as a 3h-tag. We explored other 
tag-side channels but the sensitivity of the search 
does not improve by including them. 

For the hadronic tagging modes, the invariant 
mass squared (to 2 ) of the tag-side unobserved 
particle (assumed to be a v T ) can be well recon- 
structed assuming that the candidate — > /i ± 7 
decay fully reconstructs the direction of r in the 
tag-side with the energy of the r given by \fsj2. 
|to 2 I is required to be less than 0.4 GeV 2 /c 4 for h- 
tag and 3h-tag events and less then 0.8 GeV 2 /c 4 
for 117-tag events. 

At this stage of the analysis 15% of the 
r ± — * |U ± 7 signal events survive within a Grand 
Side Band (GSB) region defined as: toec € 
[1.5,2.1]GeV/c 2 ,A£; e [-1.0, 0.5] GeV. In the 
non-blinded parts of the GSB, 4489 events sur- 
vive in the data which agrees at the 5% level with 
the Monte Carlo (MC) background expectation 
of 4709 events. 80% of MC are e+e" -> t+t~ 
events, 82% of which are — > \i~^vv decays in 
the signal hemisphere. 

To further suppress the backgrounds, separate 
neural net (NN) based discriminators are em- 
ployed for each of the six tag-side channels. Five 
observables are used as input to the NN: the 
missing mass of the event, the CM momentum 
of the highest momentum tag-side track, cos##, 
— ln(2 x Pmiss/V^) an d m l- Each NN is trained 
using data in the non-blinded part of the GSB to 
describe the background and 7x7 MC in the full 
GSB region to describe the signal. The NN input 
distributions of the data are in good agreement 
with MC backgrounds both in shape and absolute 
rates as are the distributions of the NN outputs. 
The MC is then used to determine the cut values 
to be applied to the NN outputs by optimizing on 
the expected 90%CL upper limit [23 f° r observ- 
ing a signal inside a 2a ellipse in m^c-AE plane 
centred around (toec) and (AE). 

The CM energy and momentum distributions 
of the 7 and /x candidates before and after the NN 
selection has been applied are plotted in Figure^ 



These distributions are also well described by the 
Monte Carlo simulation in both shape and rates. 
The two dimensional plot of m^cvs AE for data 
is shown in Figure |21 after the applying the NN 
selections. 
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Figure 1. E^ M and P^ M in GSB Regions before 
and after the NN selection. Data are represented 
by points, background MC by hatched histograms 
(e + e~ — ► is on top, e + e~ — > t + t~ below 

it and e + e~ — > qq on the bottom) and signal MC 
by dash- lined histograms. 



distribution of AE when mgc is restricted to 
|m EC - (m EC )| < 3cr(m EC ). 

The number of background events is estimated 
after all criteria are applied except that on m E c- 
The estimate uses the non-blinded region inside 
the band in AE: \AE - (AE) \ < 3a(AE), which 
we refer to as the toec sideband. As is evident 
from Figure these events are fairly uniformly 
spread in ttiec an d we estimate of the number of 
events in the blinded signal region by scaling the 
number of events in the sideband by the ratio of 
the area of the signal box to that of the sideband. 
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The distributions for m E c and AE before and 
after applying the NN selection are shown in Fig- 
ure where all channels have been combined. 
It is evident that there is good agreement be- 
tween the data and MC in these observables. Be- 
cause the missing mass is correlated with AE, 
after the NN selection is applied the remaining 
events tend to cluster towards low values of AE. 
Also shown in Figure El is the distribution of m E c 
for events in \AE - (AE)\ < 3<r(AE) and the 



Figure 2. m E c vs AE for data in all tag-side chan- 
nels after applying the NN requirements. The 
number of events in the data and in the MC sam- 
ples are quoted. The ellipse depicts the 2a signal 
region. 



For the MC in the 3er blinded region, the back- 
ground interpolation predicts 11.4±0.9 events 
and 9.0 events are selected. This is to be com- 
pared with the data prediction of 14. Oil. events 




Figure 3. tuec and AE in the full GSB Regions before and after the optimal NN requirement applied for 
all tag-side channels combined. Data are represented by points, background MC by hatched histograms, 
and signal MC by dash-lined histograms (with arbitary normalization). The selected number of events 
in the data is indicated in the upper right corner. The vertical lines indicate the 3cr binded region 
which is removed for the background estimation. In the lower left plot events lie within the band: 
\AE — (AE) | < 3a(AE) and in the lower right plot events lie within the band:|mEc — (wec) I < 3<r(mEc)- 



in this region. These 9.0 MC events correspond 
to 5.9 e + e~ — > events, 2.5 e + e~ — * t + t~ 

events and 0.7 uds events. The — > [i^vv de- 
cays account for four of the five e + e~ — * t + t~ 
background events observed in the MC. The ex- 
pectations for the number of background events 
in each tag-side channel are in good agreement 
with the number selected in the MC. Alternative 
means for estimating the background using the 
data yield consistent expected numbers of back- 
ground events. 

The numbers of events in data and MC in the 
GSB for the different tag-side channels are pre- 
sented in Table H both before and after the NN 
selection. Also included in the table are the num- 
bers of events in the MC that are selected in the 
2a signal region, the number of events in the sig- 



nal region predicted from the MC sideband and 
the number of events predicted from the data 
sideband. There is reasonable agreement between 
the number of events observed in the data and 
those predicted by the MC for all tag-side chan- 
nels both before and after the NN selection. 

The relative systematic uncertainties on the 
trigger efficiency tracking and photon reconstruc- 
tion efficiencies, and particle identification are es- 
timated to be 1.2%, 1.3%, 1.6% and 1.2%, re- 
spectively. We evaluate an uncertainty on the 
efficiency arising from the NN selection by fixing 
each NN input variable to its average value one at 
a time, without changing the architecture of the 
NN, and re-calculating the efficiency. This has 
the effect of removing each input variable com- 
pletely from the NN selection procedure. This 



results in 1.7% relative variation of the signal ef- 
ficiency. Adding these errors in quadrature yields 
a combined systematic error estimate of 3.2%. 

Alternatively, these (and other potential 
sources of systematic uncertainty not necessar- 
ily accounted for in the above procedure) can 
be collectively estimated from the uncertainty 
in the modelling of the detector by comparing 
data to the MC backgrounds in the non-blinded 
part of GSB, where the background and signal 
have similar properties apart from ttiec an d AE. 
The statistical precision of this data-to-MC ratio 
is augmented by using an expanded ttiec range 
€ [1.0, 2.5] GeV/c 2 , where one selects 747 and 
713 events in data and MC backgrounds, respec- 
tively. The error on this ratio 1.048±0.055(sia£)± 
0.023(norm) is ascribed to the systematic error 
on efficiency, where the 2.3% normalization error 
arises from the error on the product ha TT . 

The tracking and calorimetry systematic er- 
rors affect primarily the uncertainties introduced 
by applying the final signal region requirement. 
These systematic effects are studied using a con- 
trol sample of e + e~ — > fi + fj,~~ events with ener- 
getic photons. In order to assess uncertainty on 
the efficiency arising from the mass and energy 
scale and resolution systematic errors, the AE 
and 77iEC peaks were varied by ±4 MeV and their 
resolution by ±1 MeV. Adding the deviations in- 
duced by these variations in quadrature yields an 
uncertainty of 6.3% on the efficiency. In addition, 
we estimate the uncertainty on the efficiency as- 
sociated with the bean energy systematic error to 
be 0.6%. The total systematic uncertainty on the 
efficiency is 8.7%. 

This analysis has an efficiency of (7.45 ±0.65)% 
and an expected background, determined from 
data estimates of the number of background 
events in the 2a signal ellispe, Nf,k g , of 6.2 ± 0.5. 
After unblinding we find 12 events in the blinded 
region (to be compared with 14. Oil. expected) 
and 4 events in the signal region of our data sam- 
ple. The probability of 6.2 fluctuating down to 4 
events or fewer is 26%. Tabled lists how these 
4 events are distributed across the different tag 
channels. There is good agreement between the 
observations and expectations in all tag channels 
in the unblinded and signal regions. 



We calculate the branching fraction of the 
— > /^ ± 7 decay based on a likelihood function, 
which convolutes a Poisson distribution with two 
Gaussian resolution functions for the background 
and the efficiency: 



where B denotes the branching fraction of (r — > 
/i 7), / = 2N TT e, b is the expected total back- 
ground, fi = (n) = f B + b, n is the number of ob- 
served events, and b (/) is sampled from a normal 
distribution N(b, a^) (N(f,ar)). The number of 
tau pair events N TT is 1.97 x 10 s . The errors 
on the efficiency and normalization are incorpo- 
rated in cjf. This yields a branching fraction of 
B(t± - /i± 7 ) = (-7.5±H) x 10- 8 . 

Since we have no evidence for a signal we have 
computed an upper limit. Using the method 
of Feldman and Cousins |21| the upper limit is 
9xl0 -8 at 90%CL. The systematic errors do not 
change the limit for the number of significant fig- 
ures quoted. * 

This preliminary result reduces by more than a 
factor of three the current upper limit on the lep- 
ton flavour violating decay r — > /i 7 established 
by the BELLE Collaboration. 
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